Performance of GCC- and AMDF-Based Time-Delay Estimation in Practical Reverberant Environments
نویسندگان
چکیده
Recently, there has been an increased interest in the use of the time-delay estimation (TDE) technique to locate and track acoustic sources in a reverberant environment. Typically, the delay estimate is obtained through identifying the extremumof the generalized cross-correlation (GCC) function or the average magnitude difference function (AMDF). These estimators are well studied and their statistical performance is well understood for single-path propagation situations. However, fewer efforts have been reported to show their performance behavior in real reverberation conditions. This paper reexamines the GCCand AMDF-based TDE techniques in real room reverberant and noisy environments. Our contribution is threefold. First, we propose a weighted crosscorrelation (WCC) estimator in which the GCC function is weighted by the reciprocal of AMDF. This new method can sharpen the peak of the GCC function, which corresponds to the true time delay and thus leads to a better estimation performance as compared to the conventional GCC estimator. Second, we propose a modified version of the AMDF (MAMDF) estimator in which the delay is determined by jointly considering the AMDF and the average magnitude sum function (AMSF). Third, we compare the performance of the GCC, AMDF, WCC, and MAMDF estimators in real reverberant and noisy environments. It is shown that the AMDF estimator can yield better performance in favorable noise conditions and is slightly more resilient to reverberation than the GCC method. The GCC approach, however, is found to outperform the AMDF method in strong noisy environments. Weighting the correlation function by the reciprocal of AMDF can improve the performance of the GCC estimator in reverberation conditions, yet its improvement in noisy environments is limited. TheMAMDF algorithm can enhance the AMDF estimator in both reverberant and noisy environments.
منابع مشابه
A Pitch-based Approach to Time-delay Estimation of Reverberant Speech
Generalized Cross-Correlation (GCC) has been the traditional method for estimating the relative time-delay associated with the speech signals received by a pair of microphones in a reverberant, noisy environment. The ltering criterion employed is either focussed on the signal degradations due to additive noise or those due exclusively to multipath channel eeects. There has been relatively littl...
متن کاملEstimation of fundamental frequency of reverberant speech by utilizing complex cepstrum analysis
This paper reports comparative evaluations of twelve typical methods of estimating fundamental frequency (F0) over huge speech-sound datasets in artificial reverberant environments. They involve several classic algorithms such as Cepstrum, AMDF, LPC, and modified autocorrelation algorithms. Other methods involve a few modern instantaneous amplitudeand/or frequency-based algorithms, such as STRA...
متن کاملSource Localization in Reverberant Environments : Part II - Statistical Analysis
The main di culty in building robust practical systems for acoustical source localization using microphone arrays, is the e ects of room-reverberation. In this paper, a statistical analysis is presented of the in uence of room reverberation on source localization techniques. Using a statistical reverberation model presented in a companion paper, the Cram erRao lower bound for time-deley estimat...
متن کاملPerformance Improvement of TDOA-Based Speaker Localization in Joint Noisy and Reverberant Conditions
TDOA(time difference of arrival-) based algorithms are common methods for speech source localization. The generalized cross correlation (GCC) method is the most important approach for estimating TDOA between microphone pairs. The performance of this method significantly degrades in the presence of noise and reverberation. This paper addresses the problem of 3D localization in joint noisy and re...
متن کاملInteraural Time Difference Estimation Using Generalized Cross- correlation with Maximum Likelihood Weighting in Reverberant Environments
In this paper, an interaural time difference (ITD) estimation method is proposed for binaural speech separation in reverberant environments. First, the auditory signals are represented in the time-frequency (T-F) domain, and the ITD for each T-F bin is then estimated using generalized cross-correlation (GCC) with a maximum likelihood (ML) weighting function. In particular, the ML weighting func...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- EURASIP J. Adv. Sig. Proc.
دوره 2005 شماره
صفحات -
تاریخ انتشار 2005